Semi-Supervised Knowledge Amalgamation for Sequence Classification
نویسندگان
چکیده
Sequence classification is essential for domains from medical diagnosis to online advertising. In these settings, data are typically proprietary, and annotations expensive acquire. Often times, so few available that training a robust model scratch impractical. Recently, knowledge amalgamation (KA) has emerged as promising strategy models without this hard-to-come-by labeled dataset. To achieve this, KA methods combine the of multiple pre-trained teacher (trained on different tasks proprietary datasets) into one student becomes an expert union all teachers’ classes. However, we demonstrate state-of-the-art solutions fail in presence overconfident teachers, which make confident but incorrect predictions instances classes upon they were not trained. Additionally, to-date no work explored sequence models. Therefore, propose then solve open problem semi-supervised (SKA). Our SKA approach first learns estimate how trustworthy each given instance, rescales predicted probabilities teachers supervise model. solution overcomes through careful use very small amount instances. We beats eight alternatives four real-world datasets by average 15% accuracy with little 2% being annotated.
منابع مشابه
Semi-Supervised Sequence Classification with HMMs
Using unlabeled data to help supervised learning has become an increasingly attractive methodology and proven to be effective in many applications. This paper applies semi-supervised classification algorithms, based on hidden Markov models (HMMs), to classify sequences. For model-based classification, semisupervised learning amounts to using both labeled and unlabeled data to train model parame...
متن کاملSemi-Stacking for Semi-supervised Sentiment Classification
In this paper, we address semi-supervised sentiment learning via semi-stacking, which integrates two or more semi-supervised learning algorithms from an ensemble learning perspective. Specifically, we apply metalearning to predict the unlabeled data given the outputs from the member algorithms and propose N-fold cross validation to guarantee a suitable size of the data for training the meta-cla...
متن کاملDetecting Concept Drift in Data Stream Using Semi-Supervised Classification
Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...
متن کاملSemi-supervised Sequence Learning
We present two approaches that use unlabeled data to improve sequence learning with recurrent networks. The first approach is to predict what comes next in a sequence, which is a conventional language model in natural language processing. The second approach is to use a sequence autoencoder, which reads the input sequence into a vector and predicts the input sequence again. These two algorithms...
متن کاملOn Semi-Supervised Classification
A graph-based prior is proposed for parametric semi-supervised classification. The prior utilizes both labelled and unlabelled data; it also integrates features from multiple views of a given sample (e.g., multiple sensors), thus implementing a Bayesian form of co-training. An EM algorithm for training the classifier automatically adjusts the tradeoff between the contributions of: (a) the label...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i11.17185